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CLAIMS 



What is claimed is: 



11. A method of operation on one or more data processing machines, the 

2 method comprising: 

3 determining a first collection rating for a first rating scale for contents of a first 

4 document collection; 

5 determining a first link rating for said first rating scale for contents linked to or 
g 6 linked by contents of said first document collection; and 

/| 7 modifying said first collection rating for said first rating scale for contents of 

8 said first document collection based on said determined first link rating for said first 

Jfj! 9 rating scale for contents linked to or linked by contents of said first document 

W10 collection. 



Q 1 2. The method of claim 1 , wherein said determining of a first collection rating 

jp 2 comprises determining said first collection rating based on document ratings of a 

3 first subset of documents of said first collection of documents, and sizes of the 

4 documents of the first subset of documents of the first document collection. 



1 3. The method of claim 2, wherein said first subset of documents of said first 

2 document collection consists of first textual documents of said first document 

3 collection. 



1 4. The method of claim 1 , wherein said determining of a first link rating 

2 comprises determining at least a second collection rating for at least a second 
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3 document collection with documents linked to or linked by documents of said first 

4 document collection, and determining said first link rating based on said determined 

5 at least a second collection rating of said at least a second document collection. 



1 5. The method of claim 1 , wherein said modifying of the first collection rating 

2 comprises replacing the determined first collection rating with said determined first 

3 link rating. 



1 6. The method of claim 1 , wherein said modifying of the first collection rating 
q 2 comprises adding said determined first link rating to the determined first collection 

1 3 rating. 

fll 

Pj! 1 7. The method of claim 1 , wherein said modifying of the first collection rating 

^ 2 comprises subtracting said determined first link rating from the determined first 

If 3 collection rating. 

S3 
m 

p 1 8. The method of claim 1 , wherein said first document collection is a web site, 

2 and said contents of said first document collection are web pages. 



1 9. A method of operation on one or more data processing machines, the 

2 method comprising: 

3 determining document ratings for a rating scale for a subset of documents of 

4 a document collection; 

5 determining sizes of the documents of said subset; 
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6 determining a collection rating for said rating scale for said document 

7 collection based on said determined document ratings of said subset of documents, 

8 and normalized by said determined sizes of said subset of documents. 

1 10. The method of claim 9, wherein said determining of the collection rating 

2 comprises further subdividing said subset of documents into a plurality of groups in 

3 accordance with their determined sizes, and applying a weight to the document 

4 rating determined for said rating scale for each document of the subset in 

5 accordance to the document's size group classification. 



m i 

ill 2 

p. 

m 



1 1 . The method of claim 10, wherein weights are applied to said determined 



m 
It 



3 
1 
2 
3 
4 
5 



Document size range in (bytes) 


Weight 


<500 


1 


500 - 999 


4 


1000-4999 


7 


5000 - 9999 


10 


>9999 


13 



12. The method of claim 9, wherein said determining of the collection rating 
comprises further subdividing said subset of documents into a plurality of groups in 
accordance with their determined ratings for said rating scale, and applying a weight 
to the document rating determined for said rating scale for each document of the 
subset in accordance to the document's rate group classification. 
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13. The method of claim 12, wherein weights are applied to said determined 
docu ment ratings for said rating scale as follows: 



Determined document ratina for 

L^\>kx>l 1 1 111 IwU UvvUI 1 Iwl 1 L 1 S-A III IU Ivl 


Weiaht 

V V vlU III 


said rating scale 




0 


-0.5 


1 


0.5 


2 


3 


3 


6 



14. The method of claim 9, wherein said determining of the collection rating 
comprises computing the collection rating for said rating scale as follows: 

Z^w y iog(7V, + i) 

where CR is the collection rating for said rating scale; 

n is the weight applied for document rating group /; 

Wj is the weight applied for document size group j\ 

Ny is the number of pages in the collection with document rating / and 

having group sizes j for said rating scale. 

1 5. The method of claim 9, wherein said first collection of documents are web 
pages of a web site, and said first subset of documents are textual documents of 
said web site. 

16. A method of operation on one or more data processing machines, the 
method comprising: 
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3 determining whether a first document collection comprises at least one 

4 document linked to at least one other document of at least one other second 

5 document collection; 

6 determining a collection rating for a rating scale for each of said at least one 

7 other second document collection if said first document collection is determined to 

8 comprise at least one document linked to at least one other document of at least 

9 one other second document collection; 

10 determining whether said first document collection comprises at least one 

1 1 document being linked by at least one other document of at least one other third 
^12 document collection; 

>;13 determining a collection rating for said rating scale for each of said at least 

■pp 

r;fl4 one other third document collection if said first document collection is determined to 

5 comprise at least one document linked by at least one other third document 

Pi 6 collection; and 

Pi 7 determining a link rating for said rating scale for said first document collection 

based on either said determined collection rating or ratings for said rating scale for 



Q19 said at least one other second document collection, or said determined collection 

I* 

20 rating or ratings for said rating scale for said at least one other third document 

21 collection, or both, depending on whether collection rating or ratings are determined 

22 for said rating scale for said at least one other second document collection, said at 

23 least one other third document collection or both. 

1 1 7. The method of claim 1 6, wherein each of said determining of a collection 

2 rating for said rating scale for each of said at least one other second or third 

3 document collection comprises determining document ratings for said rating scale 

4 for documents of the particular document collection, and sizes of the documents, 
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and determining the collection rating for the particular document collection based on 
the determined document ratings and the determined sizes. 

18. The method of claim 16, wherein said determining of a link rating comprises 
summing said collection rating or ratings determined for said rating scale for said at 
least one other second or third document collection, and determining the link rating 
based on the result of said summing. 

19. The method of claim 18, wherein said determining of the link rating based on 
the result of said summing comprises determining the link rating based on the result 
of said summing as follows: 



The result of said summing (RS) 


link rating 


RS less than -2 


- 1.0 


RS greater than or equal to -2, 
but less than - 1 


-0.5 


RS greater than or equal to -1 , 
but less than or equal to - 0.5 


0 


RS greater than -0.5, but less 
than or equal to 1.5 


0.5 


RS greater than 1.5, but less 
than or equal to 3 


1.0 


RS greater than 3, but less than 
or equal to 4 


1.5 


RS greater than 4 


2.0 
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1 20. An apparatus comprising: 

2 storage medium having stored therein a plurality of programming instructions 

3 designed to enable said apparatus to 

4 determine a first collection rating for a first rating scale for contents of a 

5 first document collection, 

6 determine a first link rating for said first rating scale for contents linked to 

7 or linked by contents of said first document collection, and 

8 modify said first collection rating for said first rating scale for contents of 

9 said first document collection based on said determined first link rating 
;P J0 for said first rating scale for contents linked to or linked by contents of 
"2l 1 said first document collection; and 

Wl2 at least one processor coupled to the storage medium to execute the 

jp'13 programming instructions. 

O 1 21 . The apparatus of claim 20, wherein said programming instructions are 

f3 

CI 2 designed to enable the apparatus to perform said determining of a first collection 

t* 

O 3 rating by determining said first collection rating based on document ratings of a first 



4 subset of documents of said first collection of documents, and sizes of the 

5 documents of the first subset of documents of the first document collection. 

1 22. The apparatus of claim 21 , wherein said first subset of documents of said first 

2 document collection consists of first textual documents of said first document 

3 collection. 

1 23. The apparatus of claim 20, wherein said programming instructions are 

2 designed to enable the apparatus to perform said determining of a first link rating by 
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3 determining at least a second collection rating for at least a second document 

4 collection with documents linked to or linked by documents of said first document 

5 collection, and determining said first link rating based on said determined at least a 

6 second collection rating of said at least a second document collection. 

1 24. The apparatus of claim 20, wherein said programming instructions are 

2 designed to enable the apparatus to perform said modifying of the first collection 

3 rating by replacing the determined first collection rating with said determined first link 

4 rating. 

O 

|jj 1 25. The apparatus of claim 20, wherein said programming instructions are 

W 2 designed to enable the apparatus to perform said modifying of the first collection 

jjjjj 3 rating by adding said determined first link rating to the determined first collection 

O 4 rating. 

% 

CI 1 26. The apparatus of claim 20, wherein said programming instructions are 

$* 

ip 2 designed to enable the apparatus to perform said modifying of the first collection 

3 rating by subtracting said determined first link rating from the determined first 

4 collection rating. 

1 27. The apparatus of claim 20, wherein said first document collection is a web 

2 site, and said contents of said first document collection are web pages. 

1 28. An apparatus comprising: 

2 storage medium having stored therein a plurality of programming instructions 

3 designed to enable said apparatus to 
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determine document ratings for a rating scale for a subset of documents 

of a document collection, 
determine sizes of the documents of said subset, 
determine a collection rating for said rating scale for said document 
collection based on said determined document ratings of said subset 
of documents, and normalized by said determined sizes of said subset 
of documents; and 
at least one processor coupled to the storage medium to execute the 
programming instructions. 

29. The apparatus of claim 28, wherein said programming instructions are 
designed to enable the apparatus to perform said determining of the collection rating 
by further subdividing said subset of documents into a plurality of groups in 
accordance with their determined sizes, and applying a weight to the document 
rating determined for said rating scale for each document of the subset in 
accordance to the document's size group classification. 

30. The apparatus of claim 29, wherein said programming instructions are 
designed to enable the apparatus to apply weights to said determined document 



Document size range in (bytes) 


Weight 


<500 


1 


500 - 999 


4 


1000-4999 


7 


5000 - 9999 


10 


>9999 


13 
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31 . The apparatus of claim 28, wherein said programming instructions are 
designed to enable the apparatus to perform said determining of the collection rating 
by further subdividing said subset of documents into a plurality of groups in 
accordance with their determined ratings for said rating scale, and applying a weight 
to the document rating determined for said rating scale for each document of the 
subset in accordance to the document's rate group classification. 



32. The apparatus of claim 31 , wherein said programming instructions are 
designed to enable the apparatus to apply weights to said determined document 



for said rating scale as follows: 


Determined document rating for 
said rating scale 


Weight 


0 


-0.5 


1 


0.5 


2 


3 


3 


6 



33. The apparatus of claim 28, wherein said programming instructions are 

designed to enable the apparatus to perform said determining of the collection rating 

by computing the collection rating for said rating scale as follows: 
Zr.^iogCAT. + i) 

_ _u 

" ZW;l°g(jV, + D 

u 

where CR is the collection rating for said rating scale; 

n is the weight applied for document rating group /; 
Wj is the weight applied for document size group /; 
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8 Ny is the number of pages in the collection with document rating / and 

9 having group sizes j for said rating scale. 



1 34. The apparatus of claim 28, wherein said first collection of documents are web 

2 pages of a web site, and said first subset of documents are textual documents of 

3 said web site. 



1 35. An apparatus comprising: 



2 storage medium having stored therein a plurality of programming instructions 
^ 3 designed to enable said apparatus to 

4 determine whether a first document collection comprises at least one 

P 5 document linked to at least one other document of at least one other 

|V 6 second document collection, 

a 

Q 7 determine a collection rating for a rating scale for each of said at least one 

Q 8 other second document collection if said first document collection is 

m 

19 9 determined to comprise at least one document linked to at least one 

CEO other document of at least one other second document collection, 

1 1 determine whether said first document collection comprises at least one 

12 document being linked by at least one other document of at least one 

13 other third document collection, 

14 determine a collection rating for said rating scale for each of said at least 

15 one other third document collection if said first document collection is 

16 determined to comprise at least one document linked by at least one 

17 other third document collection, and 

18 determine a link rating for said rating scale for said first document 

19 collection based on either said determined collection rating or ratings 
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20 for said rating scale for said at least one other second document 

21 collection, or said determined collection rating or ratings for said rating 

22 scale for said at least one other third document collection, or both, 

23 depending on whether collection rating or ratings are determined for 

24 said rating scale for said at least one other second document 

25 collection, said at least one other third document collection or both; 

26 and 

27 at least one processor coupled to the storage medium to execute the 



28 programming instructions. 

* T P 1 36. The apparatus of claim 35, wherein said programming instructions are 

W 2 designed to enable the apparatus to perform each of said determining of a collection 

W 3 rating for said rating scale for each of said at least one other second or third 

Q 4 document collection by determining document ratings for said rating scale for 

P 5 documents of the particular document collection, and sizes of the documents, and 

IS 

P 6 determining the collection rating for the particular document collection based on the 

•lU 

O 7 determined document ratings and the determined sizes. 

1 37. The apparatus of claim 35, wherein said programming instructions are 

2 designed to enable the apparatus to perform said determining of a link rating by 

3 summing said collection rating or ratings determined for said rating scale for said at 

4 least one other second or third document collection, and determining the link rating 

5 based on the result of said summing. 

1 38. The apparatus of claim 37, wherein said programming instructions are 

2 designed to enable the apparatus to perform said determining of the link rating 
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based on the result of said summing by determining the link rating based on the 
result of said summing as follows: 



The result of said summing (RS) 


link rating 


DC loco fhon 0 

r\o less xnan 


- I .u 


RS greater than or equal to -2, 
dul less man - i 


-0.5 


RS greater than or equal to -1 , 
out less tnan or equal 10 — u.o 


0 


DC nrootor than 0 ^ Hi it loo c 
r\0 yitJdlci Uldll — U.v), UUl Icoo 

than or equal to 1.5 




RS greater than 1 .5, but less 
than or equal to 3 


1.0 


RS greater than 3, but less than 
or equal to 4 


1.5 


RS greater than 4 


2.0 
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